Adsorption of a random heteropolymer at a potential well 
revisited: location of transition point and design of sequences 



Alexei Naidenov 1 and Sergei Nechaev 1,2 

l L D Landau Institute for Theoretical Physics, 117940, Moscow, Russia 

2 UMR 8626, CNRS-Universite Paris XI, LPTMS, Bat.100, Universite Paris Sua 1 , 

91405 Orsay Cedex, France 

The adsorption of an ideal heteropolymer loop at a potential point well 
is investigated within the frameworks of a standard random matrix theory. 
On the basis of semi-analytical/semi-numerical approach the histogram of 
transition points for the ensemble of quenched heteropolymer structures with 
bimodal symmetric distribution of types of chain's links is constructed. It 
is shown that the sequences having the transition points in the tail of the 
histogram display the correlations between nearest-neighbor monomers. 

I. INTRODUCTION 

The problem of adsorption of an ideal heteropolymer chain with random quenched chem- 
ical structure (i.e. sequence of links) is far from being new and is well represented in the 
literature. 



The simple diffusive approach provides us complete understanding of the homopoly- 
mer and block-copolymers adsorption in various geometries. More advanced renormalization 
group methods []5],|B| and power series analysis J7| applied to random chains with a disordered 
sequence of links delivers an important information about the thermodynamic properties of 
ideal polymers near the transition point from delocalized (Gaussian) to localized (adsorbed) 
regimes. A mathematical formalism based on perturbation theory gives reasonable results 
for a phase transition in solid-on-solid (SOS) models with quenched impurities ||. Many 
conclusions of || correlate with those obtained by RG analysis in [[J . Problems dealing with 
localization of disordered polymers at selective interfaces [PHTTH should be also mentioned in 
the context of discussed problems. 

However even since early 80s during almost 20 years of intent attention some important 
questions of heteropolymer adsorption remain still open. One of them, the most intriguing, 
to our point of view, concerns the location of a point of a phase transition from coil to 
localized (adsorbed at a potential well) state. 

In this paper we consider the model of an ideal, i.e. self-intersecting ring polymer 
chain consisting of two types of links, "black" and "white" organized in different chemical 
structures having different energies in the single point-like potential well. We distinguish 
between the following models of chemical structures: 

1. " Canonically quenched" sequence — positions of "black" and "white" links are uniquely 
fixed for a given chain structure and cannot exchange in course of chain fluctuations; 



1 



the chemical structure is prepared from "black" and "white" links at random with 
prescribed probabilities. 

2. " Microcanonically quenched" sequence — positions of "black" and "white" links are 
again uniquely fixed for a given chain structure and satisfy an extra constraint: the 
total number of "black" and "white" links per chain is conserved for all realizations of 
chain structures. 

Our consideration is semi-analytical/semi- numerical. Namely, the closure of an ideal 
iV-link chain in a ring enables us to integrate from the very beginning over all space degrees 
of freedom for quenched sequences of links and explicitly rewrite the Green's function of a 
chain as a quotient of determinants of two N x iV-matrices with random coefficients on the 
main diagonals. The transition point corresponds to the situation when the denominator of 
the quotient becomes zero. This last question for large N and different models of chemical 
structures is analyzed numerically. 

The results are exposed in a form of histograms showing how many sequences have the 
transition points at a given energy. Choosing the sequences which have the transition points 
in the tail of the histogram we found that these sequences have effective correlations between 
nearest-neighbor monomers for the model (2) and do not exhibit any correlations for the 
model (1). 



Consider an iV-link ideal (i.e. self-intersecting) Gaussian polymer chain in <i-dimensional 
space. Suppose that the chain forms a ring and attach the first and the last chain's segments 
to the origin. All segments are painted in "black" or "white" colors, designating different 
interaction energies with the potential well located at the origin. There is no interaction of 
chain's links between themselves or between a link and any point in (i-dimensional space 
distinct from the origin. Increasing the interaction energy, we could provoke the transition 
from extended (coil) to localized (adsorbed) state of the macromolecule. The transition is 
governed by the interplay between entropic forces tending to expel the chain and contact 
potential interactions which gain the energy in a compact polymer configuration. 

The said above can be easily rewritten in a formal way. Let us introduce the joint distri- 
bution function Gn(x\, X2, xjv) having sense of a probability that the links with numbers 
1, 2, N of the iV-step chain are located at the points xi, x 2 , xjv in d-dimensional space. 
The Markov structure of our polymer chain ensures the presence of correlation only between 
the neighboring segments of length a and allows us to set 

Pjv(Xi, X 2 , Xjv) = £/(Xi, X 2 )#(x 2 , X 3 )...£/(x iV -l, Xjv) 

where the transition probability gfei, x 2 ) satisfies the normalization condition / <?(x, x')<ix = 
1 and for simplicity is supposed to be Gaussian: 



II. THE MODEL 




(1) 
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The Green's function G(x.n, N) = J Gjv(xi,x 2 , XAr)<ixi(ix 2 ...(ixAr_ 1 is defined as follows 



r N 

G(x N ,N) = J J] [g(xi-i,Xi)e-^ T ] d d Xl ...d d x N ^. (2) 

i=i 

where e _< ^ Xi ' )//T is the Boltzmann weight of i th chain segment in the potential well at the 
origin (compare to ||): 

p -0(x 4 )/t , , a r/„ \ R — \ e ~ Uh ~ 1 if the se g ment * is "black" , , 

t- PiO Pi i e „ Uw _ x . f gegment i ig » white » W 

and Mb, w is the dimensionless energy (everywhere below we set T = 1). 

Now we can write the recursion equation for the Green's function (0), expressing 
G(xj +1 ,z + l) in terms of G(xj,i) and corresponding Boltzman weights for < i < N. 
Performing Fourier transform and taking into account (^) we arrive at the following integral 
equation in the momentum space 

G(k, % + 1) = 0(k)G(k, i) + (2n)- d [3 l+1 J g(k')G(k', i)d d k' (4) 



where 

g(k) = e -l k l 2ffl2 / 2d (5) 



In order to make the system of equations (^) closed and nonuniform we should take into 
account the initial and boundary conditions. Let us point out that for open chains the 
precise form of initial and boundary conditions does not affect in the thermodynamic limit 
the phase transition point (see, for example, ||12|| ). The same we expect also for closed chains. 
Suppose that the last segment (with i = N) is linked to the origin by some sufficiently weak 
dimensionless potential /. Without the loss of generality we may chose / in the simplest 
form: /(k) = / = const. The presence of the potential / increases the probability G(k, N) 
to find the iV's chain link near the origin. If |/| 1 we may set in the linear approximation: 

G(k, i = N) = G(k, i = N) + f (6) 



Using the fact that the chain is closed, we complete equations (|J) by the following one: 

G(k, 2 = 1)= g(k)G(k, i = N) + (2n)- d (3 1 J g(k')G(k, i = N)d d k' (7) 

(as we shall see later / does not enter the final expression for the transition point). 

The system of iV equations (|])-(0) is closed. This enables us to write it as a sin- 
gle matrix-integral equation for the ^-component "spinor" Green's function G(k) = 
{G(k, 1), G(k, 2), G(k, N — 1), G(k, N)}. Hence we get: 

G(k) = M#(k)G(k) + (2n)- d (3M J g{k')G{k')d d k' + F(k) (8) 
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and 



F(k) = ((^(k) + (rf/27ra 2 )i)/,0,...,0 



(9) 



(10) 



Let us rearrange the terms in (§) as follows 

G(k) = (/-^(k)M)" 1 [{27c)- d f3M J g{k')G{k')d d k' + F{k) 



(11) 



where I is N x N identity matrix. Integrating l.h.s and r.h.s of ([Tl|) with the weight g(k) 
over all k, we arrive at the algebraic matrix equation for the function A = {Ai, A%, An}, 
where A { = J g(k)G(k,i)d d k: 



A = (2tt)- 



g(k) [I - g{k)M) j3M d d k 



A + F 



(12) 



The straightforward computations show that the N x N matrix C 
has the form: 



I - g(k)M) (3M 





(foci 


P3C2 P4C3 ■ 


• PlCN ^ 




P2CN 


03 c l Pa c 2 ■ 


■ PlCN-l 


6 = 


P2CN-I 


p 3 c N faa . 


■ PlCN-2 




\ P2C2 


P 3 c 3 /5 4 c 4 • 


■ PlCl ) 



with the coefficients 



Ci = Ci(k) 



g N ~ l (k) 
'^(k)-l' 



ie[l,N] 



and the vector F = (Fx, F 2 , . . . , Fn) t has components 

./ 



Fi = -ty d J (g(k) + (d/2na 2 )i) c N - l+ i(k)d d k 



(13) 



(14) 



(15) 



Hence the vector A obeys the system of equations 



4 



(At 


= (3 2 a>iAi 


+ (3 3 a 2 A 2 


+ . 


. + (3ncin~iAn-i 


+ (3ia N A N 


+ 


Ft 


A 2 


= i5 2 a N A x 


+ p 3 a ± A 2 


+ . 


. + (3nCIN-2An-1 


+ PiCLn-iAn 


+ 


F 2 


A 3 


= (3 2 a N -iAi 


+ p 3 a N A 2 


+ . 


. + PnO-nsAn-i 


+ PiO-N-2An 


+ 


F 3 


A N -! 


= (3 2 a 3 Ai 


+ P3a 4 A 2 


+ . 


. + (3n<1iAn-i 


+ (3ia 2 A N 


+ 


Fn 


, A N 


= (3 2 a 2 A x 


+ (3 3 a 3 A 2 


+ . 


. + [3ncinAn-i 


+ (3iaiA N 


+ 


Fn 



(16) 



where the coefficients a,- for 1 < % < N are defined as follows 



(H = (2n)- d I g(k)c t (k)d d k 



d 



d/2 



2na 2 N 



'd N+l-i 
2' N 



(17) 



Solving the corresponding system of linear equations via the standard Kramers method, we 
arrive at the following expression for the values P^iAf 



detA'(i) 

f3 i+1 Ai = — — i-L, i G [1, N]; (by definition (3 N +i = Pi) 
detA 



(18) 



where the N x N matrix A reads 



.4 



/ ai - (3 2 1 a 2 a 3 
cln &1 — /3 3 1 ol 2 

cln—i Q>n a i — P4 1 



A.3 

V a 2 



a 4 
a 3 



a 5 

0,1 



ajv-i a>N 

%-2 ClN-1 

a N-3 a N-2 

ax - (3 N l a 2 

a N ai - fa 1 J 



(19) 



and the N x N matrix A'{i) is obtained from the matrix A by replacing the i th column by 
the vector — F. 

Equations flTBp-fll^D enable us to rewrite N linearly independent components of the 
"spinor" G(k) = {Gi(k), G a (k), G N (k)} (see (0)) in a compact form 

-l N 

Gi(k) = (27r)- d [deti] ^ Ci _ m (k)dea'0-) + (I - g^M)- 1 ^) (20) 

3=1 

where Cj_ i+ i(k) = Cjv-y-i+i)(k) if j - i + 1 < 0. 

The transition point from delocalized (coil) to the adsorbed (globule) state of a het- 
eropolymer ring is manifested in the divergence of the Green's function Gi(k) (|20| ) for any 
% E [1,N]. Thus the transition point is determined by the equation 

deti = (21) 



where the matrix A is given by (|T9"D. Let us pay attention to the fact that equation (BTj) 
is very general: it is valid for any type of disorder and number of species (i.e. sorts of the 
links) . 
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For any finite chain lengths we can define only a transition region which due to the 
supposed self-averaging becomes sharper and sharper as the chain length increases, tending 
to a single point in the thermodynamic limit iV — > oo. 

Before passing to numerical solution of (|2T| ) for different randomly generated sequences 
belonging to canonically- and microcanonically quenched chemical structures, let us derive 
analytically the transition point for the effective homopolymer ring chain. In this case 
one can preaverage the partition function of the chain over the distribution of " black" and 
"white" links. Let us take = u, u w = —u and suppose that the number of "black" links 
is equal to the number of "white" ones. Then 

A = P av = 6 " 2 +e " - 1 = coshw - 1 (22) 

for all 1 < i < N. 

Remind that for iV-link open homopolymer chain attached by one end at the origin in 
a potential well, the critical value /3 ope n of a transition point in <i-dimensional space in the 
limit N — > oo reads IBIJll 



/r 1 



open 




0.86188 (23) 



The transition point of a closed chain can be analytically evaluated in the effective 
homopolymer case fl22|) . If all are equal, then the matrix A (see eq.(0)) is so-called 
circulant and its eigenvalues are [|L| 



A, 



TV 

E 

i=i 



i2*m(J-l)/N 



a^e 



me[0,N-l] 



(24) 



Eq. fl21"D is true when /3 1 equals to one of the eigenvalues X m that must be real. Thus 



TV / 

fa 1 = E% = 

3=1 \ 



d \ 



rf / 2 TV oo 



2na 2 N 
d/2 N _ x 



/ j \ a /^ N-l oo 

UL) EE 



\27ra 2 ) ^^{Nk-iyi* 



) SS(A: + l + (l-^)/iV)^ 

/ d V' 2 ™ 1 
\2ita?) f^f> 



(25) 



d 



d/2 



2vra 2 



c 



'd 



0' 1 

reopen 



Hence, in 3D-space for a = 1 we have 

arccosh[/3 av + 1] = arccosh 



tr 



2nJ S V2 



+ 1 



1.405 



Let us pay attention to the fact that the adsorption points of ring and open chains 
coincide in the thermodynamic limit. This is consistent with the statement on independence 
of the point of 2nd order phase transition on the boundary conditions jnj. Moreover, the 
interesting feature of (|25| ) consists in the fact that the value /3 av does not depend on N for 
ring chains. 
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III. NUMERICAL RESULTS FOR QUENCHED SEQUENCES OF LINKS. 



In this section we solve numerically Eq.(|2l|) for two different types of bimodal distribution 
of links with u\ } = u, u w = —u\ (1) canonically quenched and (2) microcanonically quenched. 
Recall that in case (1) "white" and "black" links appear in the sequence independently with 
equal probability ~; while in case (2) the total number of "black" and "white" links per 
chain is exactly y. We have generated ensembles of 10 000 chains of N segments each for 
both of models (1) and (2). The distribution of transition points W(u) in cases (a) and (b) 
is depicted in fig.|I] for N = 100. 



800 



600 - 



CO 

I 400 - 
CD 



CD 
-D 

E 

? 200 



quenched canonical distribution of links 



Gaussian fit: W c (u tr 



1.2 1.4 1.6 1.8 

transition point, u tr 
(a) 



1000 



800 



quenched microcanonical distribution of links 
Gaussian fit: W m (u, r ) 




1.6 1.7 

transition point, u tr 
(b) 



1.9 



FIG. 1. Distribution of transition points for (a) canonical and (b) microcanonical sequences of 
links. 



The shape of histograms in fig.[l]a,b are correspondingly fitted by Gaussian curves 

("tr-Kr)) 2 (»tr-«)) 2 

W c (u tr ) = A c e . w m (u t r) = A m e ; 

For = 100 the parameters A c ' m , (ul^) follows: 

' A c = 644.78 f A m = 908.49 

< « r ) = 1.756 I (vg) = 1.778 
a c = 0.116 [a m = 0.039 

To predict the numerical values of (tt£ r ) and (u™) at iV — >• oo we have studied numerically 
the dependencies (u^ r (N)) and (u™(N)) for different N. The results of our simulations are 
displayed in fig.^a, where the chain lengths iV have varied in the interval 80 < iV < 400. 
The fitting curves are as follows 

«(A0> = « c oo + u\e- N ^- «(i\r)> = u» + ufe- N '< 
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where 
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FIG. 2. Dependence of (a) mean value of transition point and (b) width of the histogram on 
chain length. 



The crucial question concerns the behavior of the width of histograms (see fig.[3]) in the 
thermodynamic limit iV — > oo. We have analyzed numerically the behavior a c ' m (N) for 
chains of length N in the interval 80 < iV < 360 and found no evident saturation of cr c,m at 
N ^> 1. The numerical data and corresponding power-law fits are shown in fig.FJb, where 

a c (N) = (Tq N~ aC a m {N) = a™ 

and the numerical values are as follows: 

(Tq = 1.331 a c = 0.52 for canoncal distribution 
cr™ = 0.143 a m = 0.285 for microcanoncal distribution 



IV. DISCUSSION 



A. Location of transition points in thermodynamic limit 



The results of our numerical simulations of eq. (^l|) permit us to conclude that within the 
standard error the mean values of phase transition energies (w£ r ) and (w™) in chains with 
symmetric bimodal canonically and microcanonically quenched sequences tend to one and 
the same value (u£ r ) = (w™) = Woo ~ 1-75 ± 0.02 in the limit iV — > oo. At the same time 
the transition point in an effective homopolymer chain defined by Eq . (p2|) is = 1.405. 
These numerical results seem to be sufficiently strong to conjecture the difference between 
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the mean value of transition point for ensembles of heteropolymers with quenched sequences 
of links and the transition point in an effective homopolymer chain. However one should 
honestly say that the expressed conjecture deserves more investigation and still cannot be 
considered as a rigorous statement. The reason why we cannot regard it yet as an exact one 
is as follows. 

1. We have analyzed the distribution of transition points in sufficiently long but finite 
chains (N ~ 80 -f- 360). Denote by Aufr n (N) the transition region for finite N. As we know 



from the theory of 2nd order phase transitions [12,14 



Aw fin (iV) ~ N~ 1/2 (26) 

as N — > oo due to the finiteness of N for homopolymer chains. 

2. The quenched randomness in types of links leads to an extra uncertainty in the location 
of a transition point for heteropolymer chains. The transition region Au ran (N) due to the 
randomness in sequences of links can be estimated by the width of the histograms — see figfj]. 
So, we may set 

^ , , f N~ aC for canonically quenched sequences , , 

r } N~ am for microcanonically quenched sequences 



The behavior (27) is consistent with general self-averaging hypothesis for disordered 
systems [[15[ claiming the existence of unique transition point for every disordered sequence 
in the thermodynamic limit N — > oo, i.e. 







Considering fl26|) and (P7|) as independent contributions we arrive at the statement on 
the difference between Uoo and in the limit N — > oo. However one cannot exclude 
the possible "interference" between (|26f) and (p7|) which may lead to the smearing of the 
transition region. This last possibility is not analyzed yet. Moreover, the asymptotics fl2"7|) 
is based on the interpolation of numerical results for finite iV to iV — > oo. The importance 
of ( pT|) deserves an analytic support of our conclusion. 



B. Design of sequences 



The data of our numerical simulations stored in course of the construction of the his- 
tograms (see fig.|I|) allow us to regard the following problem. Let us define the correlation 
function \ for each ring chain 

Y N-l 

X = T7 E °3 ( 28 ) 
JV 3=1 
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where the sum runs along all the chain's segments j = [1, N] and 




+ 1 for "black" segment 
— 1 for "white" segment 



Knowing the transition point (the solution of Eq. (PT|) ) for every of 10 000 sequences (for 
both of models (1) and (2)), let us assign to each sequence the point in the coordinates 
(u,x{u)) on the plane. The corresponding "spots" for the models (1) and (2) are shown in 
fig.^a,b for N = 300. These figures are lead to the following important conclusions. 

canonocal distribution of links microcanonical distribution of links 

100 



50 - 



^ - 



-50 - 
-100 - 

1.4 1.6 1.8 2 2. 

(a) (b) 

FIG. 3. The two-dimensional set (u, x( u )) f° r ( a ) canonically and (b) micro canonically quenched 
sequences; N = 300. 

1. There are no essential visible correlations between nearest-neighbor links in the ensem- 
ble of canonically quenched sequences what is manifested in the fact that the corresponding 
"spot" is circular — see fig|3]a. 

2. There are correlations between nearest-neighbor links in the ensemble of microcanon- 
ically quenched sequences what is manifested in the fact that the corresponding "spot" is 
asymmetric. Moreover, the figure |3|b allows to conclude how the primary structure of the 
heteropolymer chain is organized: the positive correlations (x{u)) > for u < (u m ) show the 
preferable clustering in the subsequences of the same color, while the negative correlations 
(x{ u )) < f° r u > (u m ) show the preferable mixing of opposite colors. 

Thus, knowing the value of the transition point for the chain of "microcanonically 
quenched ensemble" we can make an approximate conclusion about the primary sequence 
of monomers in a given chain. The fact that the width of the histogram shrinks to zero 
at N — > oo means that apparently at iV — » oo the corresponding "spot" in fig.f|b becomes 
more and more symmetric and the nontrivial correlations disappear in the thermodynamic 
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limit. However every real physical polymer system consists of finite number of monomers 
what means that the above mentioned effect should in principle exist. 

We expect that the approach proposed in the present work turns the initial problem into 
well-posed subject of statistical theory of random matrices, what would allow to apply the 
standard methods of random matrix theory for heteropolymer adsorption. 
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